An Apache open source project, Hadoop stores huge amounts of data in safe, reliable storage and runs complex queries over data in an efficient way. It is at the core of a whole host of the most popular Big Data tools. Mastering Hadoop ensures you get the best out of all these tools and better insight from your data. Elton Stoneman’s Hadoop Succinctly explains how Hadoop works, what goes on in the cluster, demonstrates how to move data in and out of Hadoop, and how to query it efficiently. It also walks through a Java MapReduce example, illustrates how to write the same query in Python and .NET, and discusses the wider Hadoop ecosystem.
Introducing Hadoop
Getting Started with Hadoop
HDFS—The Hadoop Distributed File System
YARN—Yet Another Resource Negotiator
Hadoop Streaming
Inside the Cluster
Hadoop Distributions
The Hadoop Ecosystem
978-1-64200-113-6
September 19, 2016
83
Looking for something specific? Try our title or author search.